Beating the Perils of Non-Convexity: Guaranteed Training of Neural Networks using Tensor Methods

نویسندگان

  • Majid Janzamin
  • Hanie Sedghi
  • Anima Anandkumar
چکیده

Training neural networks is a challenging non-convex optimization problem, and backpropagation or gradient descent can get stuck in spurious local optima. We propose a novel algorithm based on tensor decomposition for training a two-layer neural network. We prove efficient risk bounds for our proposed method, with a polynomial sample complexity in the relevant parameters, such as input dimension and number of neurons. While learning arbitrary target functions is NP-hard, we provide transparent conditions on the function and the input for generalizability. Our training method is based on tensor decomposition, which provably converges to the global optimum, under a set of mild non-degeneracy conditions. It consists of simple embarrassingly parallel linear and multi-linear operations, and is competitive with standard stochastic gradient descent (SGD), in terms of computational complexity. Thus, we have a computationally efficient method with guaranteed risk bounds for training neural networks with general non-linear activations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Decentralized Adaptive Control of Large-Scale Non-affine Nonlinear Time-Delay Systems Using Neural Networks

In this paper, a decentralized adaptive neural controller is proposed for a class of large-scale nonlinear systems with unknown nonlinear, non-affine subsystems and unknown nonlinear time-delay interconnections. The stability of the closed loop system is guaranteed through Lyapunov-Krasovskii stability analysis. Simulation results are provided to show the effectiveness of the proposed approache...

متن کامل

The relationship between Neural Networks and DEA-R (Case Study: Companies Stock Exchange)

   Evaluate the performance of companies on the Stock Exchange using non-parametric methods is very important. DEA and DEA-R with the strategies for piecewise linear frontier production function and use of available data, assess the stock company. In this study, using a neural network algorithm DEA and DEA-R is suggested to classify the first companies in the stock exchange; Secondly, using the...

متن کامل

Traffic Signal Prediction Using Elman Neural Network and Particle Swarm Optimization

Prediction of traffic is very crucial for its management. Because of human involvement in the generation of this phenomenon, traffic signal is normally accompanied by noise and high levels of non-stationarity. Therefore, traffic signal prediction as one of the important subjects of study has attracted researchers’ interests. In this study, a combinatorial approach is proposed for traffic signal...

متن کامل

Estimation of Soil Infiltration in Agricultural and Pasture Lands using Artificial Neural Networks and Multiple Regressions

Common methods to determine the soil infiltration need extensive time and are expensive. However, the existence of non-linear behaviors in soil infiltration makes it difficult to be modeled. With regards to the difficulties of direct measurement of soil infiltration, the use of indirect methods toestimate this parameter has received attention in recent years. Despite the existence of various th...

متن کامل

Designing an expert system for differential diagnosis of β-Thalassemia minor and Iron-Deficiency anemia using neural network

Introduction: Artificial neural networks are a type of systems that use very complex technologies and non-algorithmic solutions for problem solving. These characteristics make them suitable for various medical applications. This study set out to investigate the application of artificial neural networks for differential diagnosis of thalassemia minor and iron-deficiency anemia. Methods: It is...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015